Reducing Queuing Stalls Caused by Data Prefetching

نویسندگان

  • Rao Fu
  • Antonia Zhai
  • Pen-Chung Yew
  • Wei-Chung Hsu
  • Jiwei Lu
چکیده

Data prefetching is an effective way to hide memory latency. Current microprocessors allow multiple prefetch requests to overlap and provide a queue structure to track the outstanding requests. A prefetch request can either wait or be dropped when there is no available room in the queue. Recognizing the effectiveness and accuracy of the prefetches generated by the compiler, recent architectures allow the selection of stalling or dropping to be visible to the compiler. Although such designs prove to be beneficial, they also expose a new kind of stall, queue full stall. Large percentage of queue full stall as high as 50% can be observed for SPEC CFU2000 on both UltraSparc IV+ and Itanium 2. In this paper, we explain the reasons for queue full stall. We propose three different approaches (prefetch scheduling, selectively dropping prefetches and using nontemporal locality hints) for reducing queue full stall and show the performance potential for both SPEC CPU2000 and SPEC CPU2006 [10]. By implementing the third approach in a dynamic binary optimization framework on Itanium 2, we are able to improve the performance of a set of kernel and SPEC CFP2000 programs by as much as 16.5%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Oriented Prefetching Enhancements Using Commit Stalls

Loads that miss in L1 or L2 caches, and are waiting for their data at the head of the ROB, cause significant slow down in the form of commit stalls. We identify that most of these commit stalls are caused by a small set of loads, referred to as LIMCOS (Loads Incurring Majority of COmmit Stalls). We propose simple history-based classifiers that track commit stalls suffered by loads to help us id...

متن کامل

Software Data Prefetching for Software Pipelined Loops

This paper focuses on the interaction between software prefetching (both binding and nonbinding prefetch) and software pipelining for statically-scheduled machines. First, it is shown that evaluating software pipelined schedules without considering memory effects can be rather inaccurate due to stalls caused by dependences with memory instructions (even if a lockup-free cache is considered). It...

متن کامل

Energy-Constrained Prefetching Optimization in Embedded Applications

In energy-constrained settings, most low-power compiler optimization techniques take the approach of minimizing the energy consumption while meeting no performance loss. However, it is possible that the available energy budget is not sufficient to meet the optimal performance objective. To limit energy consumption within a given energy budget, energy-constrained optimization approach is more si...

متن کامل

DSTRIDE: Data-Cache Miss-Address-Based Stride Prefetching Scheme for Multimedia Processors

Prefetching reduces cache miss latency by moving data up in memory hierarchy before they are actually needed. Recent hardware-based stride prefetching techniques mostly rely on the processor pipeline information (e.g. program counter and branch prediction table) for prediction. Continuing developments in processor microarchitecture drastically change core pipeline design and require that existi...

متن کامل

Streaming Prefetch

In most commercial processors, data prefetching has been disregarded as a potentially eeective solution to hide cache misses, multi-level caches being widely preferred. However, multi-level caches are mostly eeective at removing capacity and connict misses, while prefetching is particularly eecient for removing compulsory misses, especially in the regular accesses found in numerical codes. One ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007